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Abstract 

The individualization-refinement paradigm for computing a canonical labeling and/or 
the automorphism group of a graph is investigated. New techniques are introduced with 
the aim of reducing the size of the associated search space. In particular, a new partition 
refinement algorithm is proposed, together with graph invariants having a global nature. 
Experimental results and comparisons with existing tools, such as nauty, reveal that the 
presented approach produces a huge contraction of the search space. Such reduction will be 
shown to be exponential for special classes of graphs which are intractable by nauty. 

1 Introduction 

When we consider the literature about methods for approaching the graph isomorphism problem 
(GI) or the related problem of canonical labeling of graphs, a peculiar panorama appears. 

From the theoretical side, besides papers substantiating the thesis that GI is not NP-complete 
[20, 26] (a survey is in [1]), there is a huge amount of notable pieces of mathematics showing 
the existence of polynomial solutions of GI, for significant classes of graphs. While moderately 
exponential solutions have been provided for the general problem of graph isomorphism [3, 6], 
polynomial algorithms exist for planar graphs [12, 13], graph of bounded genus [10], graphs with 
colored vertices and bounded color-classes [2], graphs with bounded multiplicity of eigenvalues 
[5], graphs of bounded valence [18], and more (see [4]). 

From the practical side, there are some notable pieces of software, which originate from the 
outstanding tool nauty [21]. nauty was introduced in the 80's by McKay [22]; it has become a 
standard in the area of canonical labeling and detection of the automorphism group of a graph; 
moreover, it has been incorporated into more general mathematical software tools such as GAP 
[11] and MAGMA [19]. 

It is important to observe that none of the polynomial algorithms, mentioned above, has been 
implemented in software, as noted by Junttila and Kaski in [15]. A reasonable justification for 
this absence seems to be that, usually, given a class C of graphs for which an efficient algorithm 
for isomorphism testing exists, nauty is able to treat almost all the graphs of C, still remaining 
(considerably, indeed) below the theoretically established bound. However, there might exist 
subsets of C for which nauty exhibits an exponential behavior, as it results from the series of 
graphs constructed by Miyazaki in [23]: all these graphs are 3- regular and have color-class size 
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equal to 4, hence they intersect two classes of graphs for which polynomial solutions for GI exist. 
A further weird feature of Miyazaki's graphs is that the size of their automorphism group is quite 
large, such size increasing with that of graphs. This contrasts with the general agreement that 
hard graphs for nauty have a high degree of regularity but a small automorphism group. 

In recent years, the tools saucy [9, 8] and hliss [15, 14] have been introduced, with the 
aim of handling large and sparse graphs coming either from the satisfiability problem (SAT) or 
from industrial applications. As well as nauty, these arc general purpose devices implementing 
backtrack algorithms based on the so called individualization-refinement technique and, though 
similar in nature, they differ from each other for the used data structures and heuristics 

The motivation of the present paper is in some sense orthogonal to that of saucy and hliss] 
while they provide more efficient data structures (some of them already adopted by the newest 
release of nauty) and heuristics to be placed into the precise structure of nauty, we start our inves- 
tigation from the analysis of some critical features of the rcfinement-individualization paradigm 
and we propose new solutions and a new algorithmic design. Our reference graphs are not large 
and sparse graphs with large automorphism groups, but rather all the known hard graphs for 
nauty, such as graphs built from Hadamard matrices, graphs coming from projective planes, 
other graphs of combinatorial origin and, obviously, Miyazaki's graphs. The aim is to make 
the computation for all these graphs feasible; the price we are ready to pay for this is just a 
reasonable loss of cfBcioncy in the cases of "easy" graphs. 

The complexity of a tool based on the individualization-refinement technique essentially comes 
out from the size of the associated search space, usually implemented as a tree, and from the 
complexity of the refinement function (which is invoked once for every node of the search tree) . 
On its hand, the behavior of the refinement procedure affects the size of the search tree, as 
nauty's vertex-invariants show. However, the choice of the correct invariant is left by nauty to 
the user, since the usefulness of one of them strictly depends on the input graph: in some sense, 
the adoption of a vertex invariant deviates from the initial assumption of a general purpose tool. 

Our aim is to propose a new approach in connection with three primary issues of the 
individualiza-tion-refinement paradigm for canonical labeling of graphs: the size and depth of the 
search tree, the refinement procedure, the selector of the next individualization step. We suggest 
a different structure for the refinement procedure, which will be called niulti-rcfi/acrncnt, whose 
corresponding invariant is not based on local properties of vertices of the graph (such as adja- 
cencies, incident cliques and others): two vertices will be considered equivalent (and therefore 
assigned to the same color class) when their individualizations induce (after refinement) equiv- 
alent partitions. This enables to reduce the depth of the search tree, obtaining finer partitions, 
and authorizes new criteria for selecting the color class of the next individualized vertices, in our 
case the one producing the partition with maximal size. 

Our multi-refinement is computationally much heavier than usual refinement. In the worst 
case, we could be losing a factor of n (the number of vertices of the considered graph) in time, 
when the refinements of all vertex individualizations are needed to produce the multi-refinement. 
However, from one hand some techniques will be introduced in order to manage the required 
refinements without computing them completely, including the use of information coming from 
the automorphism group of the graph and a new representation of the whole refinement process. 
Prom the other hand, multi-refinement naturally suggests its parallel execution, though this is 
outside the scope of the present paper. 

^We briefly recall here that the key of the individualization-refinement technique is the notion of equitable 
partition, a coloring of vertices of a graph such that any two vertices with the same color have the same amount 
of neighbors in each color class; this induces an equivalence relation. A vertex is individualized by assigning to 
it a fresh color, the so obtained coloring is refined when a new equitable partition (finer than the initial one) is 
produced. 
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On the positive side, multi-refinement will cause a considerable reduction of the size of the 
search tree: in particular, such decrease will be shown to be dramatic when very large search 
trees are needed by nauty and other tools; as an example, an exponential reduction of the size of 
the search tree will come out for Miyazaki's sequence of graphs. In all cases, multi-refinement, 
combined with the associated selector of the next individualization step, causes the reduction of 
the depth of the search tree. We observe that the contraction of one level of the search tree often 
pays back, in terms of time, the loss caused by multi-refinement. 

Within the mentioned motivations, we will introduce a tool named Traces, whose main con- 
tributions can be summarized as follows: Traces either computes a canonical form for a colored 
graph or a set of generators for its automorphism group; in the latter case, a method is defined 
and implemented, which can be applied to check isomorphism between two graphs Gi and G2 
without considering Gi U G2*-^\ with the motivation of speeding-up the computation of the au- 
tomorphism group in the case of "hard" graphs. Traces does not use backtrack to traverse the 
search space; at the implementation level, the latter is not even treated as a tree, though, for 
expository reasons, we will still refer to it as the search tree. In Traces, the next individualiza- 
tion step is chosen during the refinement procedure as the one which maximizes the size of the 
partition which will come after it, in order to define an individualization selector which does 
not leave vertices of the graph unconsidered as individualized ones. The refinement procedure of 
Traces produces a partition such that, for any cell W, the usual individualization and refinement 
of all elements of W induce the same quotient graph; moreover, partitions are compared (and 
possibly discarded) without computing them completely, using a linear representation which we 
will call trace. Detected automorphisms are manipulated in Traces by means of the Schreier-Sims 
algorithm [28] (sec also [27]); information on the group structure is also used by the refinement 
procedure to eliminate redundant computations. 

The choice of Schreier-Sims algorithm is due to the fact that it allows, by automorphism 
detection, the highest range of pruning of the search tree (sec [16]), which is the main concern 
of this paper. Leon's implementation [17] of the Schreier-Sims algorithm is integrated in Traces: 
such code is extremely efficient in the case of graphs which do not exhibit a very large automor- 
phism group, those we are mainly interested in. We will point out the cases where the use of the 
Schreier-Sims algorithm heavily affects the whole computation. 

We will produce results and performance tables using fragments of the huge catalogue of 
benchmark graphs for canonical labeling and automorphism group computation compiled by 
Junttila and Kaski [15]. In order to give an anticipation of our results and a final account on our 
motivations, we report some experiments, taken from Tables 1 and 2 and executed on an Apple 
Mac Pro with 2 Quad Core processors at 2.8 GHz: 

- pp-16-9 is a (546 vertices, 4641 edges) graph associated to one of the known projective 
planes of order 16 (see [25]): bliss, which is considerably faster than nauty on this graph, 
takes approximately one hour to canonize it, traversing more than 300 million nodes of the 
search tree, which has depth 6; Traces reduces the depth of the search tree down to 3 and 
traverses 187 nodes, only, in 0.39 seconds; the time spent in group computation is in this 
case negligible; the size of the automorphism group of the graph is 921, 600; 

- had-100 is a (400, 20200) graph associated to a 100 x 100 Hadamard matrix (see [29]): bliss 
takes 4.23 seconds to canonize had-100, traversing more than 50,000 nodes of the search 
tree, which has depth 5; Traces reduces the depth of the search tree down to 2 and traverses 
725 nodes in 3, 20 seconds; the time spent in group computation is in this case negligible, 
too; the size of the automorphism group of the graph is 400; 

^Recall that Mathon's proof [20] of the equivalence between the graph isomorphism problem for Gi and G2 
and the problem of computing generators of the automorphism group of a graph requires the computation of the 
automorphism group of Gi U G2 . 
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- while canonizing a complete graph, Traces spends almost all the time in group computation; 

- the sequence of graphs mz-aug2-2x derives from Miyazaki's construction; bliss traverses 
more than 1 million nodes to canonize mz-aug2-18 (a (432, 684) graph), more than 4 million 
nodes to canonize mz-aug2-20 (a (480, 760) graph), more than 16 million nodes to canonize 
mz-aug2-22 (a (528,836) graph), and so on. On its hand. Traces traverses 145 nodes to 
canonize nnz-aug2-18, 161 nodes to canonize mz-aug2-20, 177 nodes to canonize mz-aug2-22, 
a linear sequence. 

These are prototypical cases. They show that the difference in performance between Traces and 
other tools for canonical labeling of graphs is located in the ability to prune the search space. 
When, as in the first case, the ratio between sizes of the search spaces is substantial, then Traces 
runs much faster than other tools; otherwise, their timings arc comparable, modulo the time 
spent in the group-related computations. Traces is slower than other tools when the associated 
search spaces are very small, usually when the input graph has a low degree of regularity or a 
high degree of symmetry. Even better results will be shown in the case of automorphism group 
computation instead of canonical labeling. 

1.1 Structure of the paper 

In the next section, some properties of graphs and partitions will be introduced; the individualiza- 
tion-refinement technique will be described in section 2.1. Section 3 is dedicated to the presen- 
tation of the tool Traces, which implements our approach to canonical labeling of graphs. In 
particular, the multi-refinement operation will be defined, together with techniques for compar- 
ing partitions and choosing the next individualization step (section 3.1) and the strategy for 
traversing the associated search space (section 3.2); the method for computing the automor- 
phism group of a graph will be presented in section 3.3. Experimental results are presented and 
commented in section 4, in relation to nauty and hliss. A brief discussion and suggestions for 
further work conclude the paper. 

2 Graphs and partitions 

A (labeled simple) graph is a pair (V^E), where is a finite set of vertices and E \s a. set of 
unordered pairs of vertices called edges. If (u,w) S E, we say that v and w are adjacent or 
neighbors. A (vertex) colored graph is a pair G = {H, x), where H is a graph and x is a function 
assigning colors to vertices of H. A more precise definition of the coloring function will be given 
in (1). 

In this paper, [n] will denote the set {l,...,n}, while G[„] will denote the set of colored 
graphs with vertex set [n] . The whole set of colored graphs will be denoted by G. Note that any 
graph is a colored graph in which all vertices have the same color. 

Isomorphisms of graphs in G[„] are permutations of {1, ... , n} preserving adjacency as well 
as non adjacency. When considering graphs with colored vertices, isomorphisms must preserve 
colors, too. We will write Gi ~ G2 when Gi and G2 are isomorphic. An automorphism is an 
isomorphism between a graph and itself. The automorphism group Aut(G) of a graph G is the 
set of all automorphisms of G with permutation composition as group operation. 

Definition 2.1 (Invariant, canonical form). 

1. An (isomorphism) invariant on G is a function / from G to a set D such that 

Gi ~ G2 ^ /(Gi) = /(G2). 

2. A function C : G — > G is a canonical form iff 
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(i) VG G G : C{G) ~ G; 

(ii) VGi, G2 e G : Gi ^ G2 o C(Gi) = C(G2). 

Definition 2.2 (Ordered partition). An ordered partition of [n] is a sequence tt = {Wx, . . . , Wr) 
of disjoint non-empty subsets of [n], called cells, whose union is [n]. The set of ordered partitions 
of [n] will be denoted by !![„], while 11 will denote the set of all ordered partitions. 

A cell of a partition it G !![„] is trivial when it contains only one element. The partition tt is 
discrete if all its cells are trivial; tt is the unit partition when it is constituted by only one cell, 
i.e. TT = {[n]). For any tt € n[„] and v, w, G [n], we will write u ~7r w when v and t« belong to the 
same cell of tt. 

Definition 2.3 (Index, position). 

1. The index ind{v, tt) of a vertex v G [n] in an ordered partition tt G n[„] is the index of the 
cell of TT in which v appears, namely ind{v, {Wi, . . . , Wr)) = k when v G Wk- 

2. The position of a node v G [n] in an ordered partition tt G n[„] is defined by means of the 
function pos : [n] x n[„] — > [n] such that 

fe-i 

ind{v, {Wu...,Wr)) = k^ pos{v, (VFi, . . . , W^)) = 1 + ^ | | . 

i=l 

3. The position of a cell in an ordered partition tt is defined as the position of an element of 
W iiiTT (indeed, all the elements of W share the same position in tt) ; with some overloading; 

pos {W, tt) = pos {v, tt), for any v G W. 

As an example, if tt = ({2, 3}, {5}, {1, 4}) G Hp] , then 

pos (2, tt) = pos (3, tt) = 1 = pos ({2, 3}, tt) 

pos (5, tt) = 3 = pos ({5}, tt) 
pos (1, tt) = pos (4, tt) = 4 = pos ({1, 4}, tt) 

In particular, pos {v,{[n])) = 1, for any v € [n]. 

Definition 2.4. Let = (J =4n, where the relation =<;„ (C x Il[n]) is defined as follows: 

n 

TTl =4n T^2 ^ Vu G [n] I p0s{v,TTl) ^ pOS (w,7r2). 

If, TTi ^772, we say that tti is finer than 712 (and that tt2 is coarser that tti). 

Let TT G n[„] and let {xii ■ ■ ■ ^Xn) be a sequence of colors. The graph G = {{[n], E),tt) is 
a colored graph if we interpret the partition tt as a function assigning the pos {v, 7r)-th color 
Xpos{v,7r) to the vertex v of G. Up to renaming of colors, the converse is also true. Therefore in 
the rest of the paper we will assume a colored graph to be a pair 

G={{[n],E),TT), (1) 

where tt is an ordered partition of [n]. Note that, with the present assumption, the ordered 
partition tt induces an ordering over colors of the graph G. 
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2.1 The individualization-refinement technique for canonical labehng 

In this section, the behavior of the individuahzation-refinement technique for canonical labeling 
is revisited. 

Definition 2.5 (Individualization). Let n — {Wi, . . . , Wr) € Il[n] be a partition of [n]. For any 
V E [n], if V belongs to a not trivial cell Wi, then we denote by 7r|w the partition obtained from 
TT by splitting the cell Wi into the cells {v} and Wi ~ {v}, namely 

7:lv = (VFi, . . . , M, W, - M, W,+i, ...,Wr), live W,. 

If G = {{[n], E),it) G G[„], then we say that G' — {{[n],E),TT[v) is obtained from G by individu- 
alizing the vertex v. 

Note that 

, I N f po.s (ui, tt) + 1 iiwEWi-{v}, 

P0S{W, TTiv) = \ ) { ^.u ■ 

pos(w,TT) otherwise. 




Definition 2.6 (Equitable graph). Let G = {{V,E),Tr) S G be a colored graph. We say that 
G is equitable when for every cell W of tt and for any two vertices v,w € W, v and w have an 
equal number of neighbors in each cell of tt. 

Definition 2.7 (Quotient graph). Let G = {{V,E),tt) G G be an equitable graph. The quotient 
graph of G, notation Q{G) ~ {V , E'), is a graph, with possible multiple edges and loops, having 
vertex set V = {pos (w, tt) | w S V} and edge multiset E' = ^{pos (w, tt), pos (w, tt)) | (w, w) € E^. 

Figure 1 shows an equitable graph G and its quotient graph. For every color appearing in G, 
there exists a corresponding vertex in Q{G); an edge in Q{G) has multiphcity A: if G has k edges 
joining vertices with the corresponding colors. Observe that, if we ignore edge multiplicities and 
loops, then the quotient graph Q{G) is isomorphic to a subgraph of the graph G^^\ Clearly, 
when the coloring partition tt is discrete, then Q{G) is isomorphic to the graph G itself. 

Definition 2.8 (Refinement). Let G = {H,7r) G G. A refinement of G is a function R : 
G ^ G such that: (i) ^'^^R{H,-k) = {H,-k') is an equitable graph with tt' ==; tt; (ii) R preserves 
isomorphisms, i.e. {Hi,ni) ~ {H2,tt2) ^ R{Hi,tti) ~ R{H2,n2)- 

^In the case of singleton vertices, this enables the partial leaf certificate of [15]. 

*By abuse of notation, while considering, for some W, a function F : G —> W, we will write F(H,it) instead of 
F({H, tt)) in order to avoid duplicated parentheses. 
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The refinement procedures implemented in nauty, saucy and bliss can be briefly described 
as follows: using a specific scheduling, which is isomorphism invariant, cells of tt are visited. 
For each visited cell W, the multiplicities of adjacencies of elements of W are counted. For 
any cell Z, the elements of Z are rearranged according to these values, and Z is split into the 
cells Zi, . . . , Zk such that all the elements of Zi have the same amount of neighbors in W. This 
process is repeated until the graph is equitable. 

Definition 2.9 (Target cell). Let G — {H,tt) G G. A target cell (selector) of G is a function 
T : G [n] such that: (i) T{H,tt) = k is the position of a cell of tt; (ii) T is an isomorphism 
invariant, i.e. (-ffi,7ri) ~ (112, 112) => T{Hi,7Ti) = T{H2,n2). 

The target cell selector used in nauty, saucy and bliss has a static nature: cells are classified 
according to the presence of edges and non-edges wrt other cells. The leftmost cell having the 
highest value in such classification is chosen as the target cell. 

The target cell selector of the present tool has a dynamic nature and is embodied into the re- 
finement procedure, which will be presented in Section 3. Prior to its definition, a brief description 
of the behavior of algorithms for graph canonization based on the individualization-refinement 
scheme is needed. 

Notation 2.10. Given an equitable graph G{H, tt) and a vertex v of H, we win write (-ff,7r(^)) 
as a shorthand for the refinement of the graph obtained by individualizing v: 

The search tree of an equitable graph G = [H, tt) is a labeled tree T{H, tt) such that 

- the root of T{H, tt) has label tt; 

- if TT is discrete, then the tree rooted at it consists of that single node; 

- otherwise, let tt = [Wi, . . . , Wi, . . . , Wu) and let Wi = {vi, . . . , vu} be the target cell of 
{H,tt). Then the tree rooted at tt has as children the trees r(iJ, Tr'^'i)), . . . , r(iJ, tt^'"-') 
and, for i — 1, .... h, the edge joining tt with T(iJ, tt'"')) has label Vi. 




({4}, {5}, {6}, {3}, {2}, {!}) ({4}, {6}, {5}, {2}, {3}, {1}) ({5}, {4}, {6}, {3}, {1}, {2}) 

Figure 2: The pruned search tree; at the leaves, graphs and their quotients coincide 

The typical behavior of algorithms based on the individualization-refinement mechanism is 
exemplified by the backtrack search in Figure 2, which is borrowed from nauty^s user guide [21]. 

Given an equitable colored graph G = {H, tt), the root of the tree is labeled by the partition tt. 
The target cell is here the underlined one, namely {4, 5, 6}. Any node of the tree is labeled by an 
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equitable partition which is obtained by its father by individuahzing each vertex in the target cell 
and then refining the obtained partition. The individualized vertices appear as edge labels in the 
tree. When the leftmost leaf is reached, the corresponding quotient graph is stored as a potential 
canonical form Cq for G. When the next leaf is reached, its corresponding quotient graph C'q 
is compared with Cg- If they are equal, an automorphism of G has been found. Otherwise, C'q 
is either discarded or it substitutes Cg if it is "better" according to some predefined ordering. 
Only part of the whole backtrack tree is actually generated. The other parts of the tree are 
either shown to be equivalent to parts already generated, or are pruned by means of invariant 
information discovered while traversing the tree itself. As an example, in Figure 2, the vertex 
6, which belongs to the target cell of the root, is not individualized, since the first two visited 
leaves identify the permutation (2, 3) (5, 6) as an automorphism of G. This implies that the tree 
obtained by individualizing the vertex 6 is isomorphic to the one individualizing the vertex 5: 
therefore it does not carry any additional information. 

3 Introducing Traces 

We start by introducing the refinement procedure of Traces, which is based on the following 

notion: 

Definition 3.1. Given G = {H,Tr) € G, the partition tt respects node individualizations when, 
for every pair of vertices v and w of G, 

v^^w^ QiH,Tr<-^^) = Q(ir,7r("')), 

i.e., if V and w belong to the same cell of tt, then the refinements of their individualizations 
induce the same quotient graph. 

Definition 3.2 (Multi-refinement). A rn,uHi-refinement is any refinement R : G Q such that 

R{H,tt) = {H,n') where tt' respects node individualizations. 

Traces uses the multi-refinement procedure defined by Algorithm 1. Starting from the equi- 
table partition tt, and for any cell W of tt, vertex individualizations (followed by refinements) of 
elements of W are computed. We obtain quotient graphs and we lexicographically order them; 
the cell W is split (if possible) according to such ordering. This process is repeated until no 
further splitting can be produced. If the individualizations-refinements of the elements of a sub- 
set W of W give discrete partitions, then the value of R{H, tt) is directly produced by taking 
the discrete partition which gives the lexicographically smallest quotient graph. Note that any 
discrete partition vacuously respects individualizations. 

Remark 3.3. (i) If we replace, in the statement marked by (★) of Algorithm 1, the function 
call Refine by MultiRefine, then we obtain a recursive algorithm which computes the orbit 
coloring of the input graph; 

(ii) for each orbit of the automorphism subgroup computed so far, only one vertex is indi- 
vidualized; singleton cells are obviously not considered for individualization; when some new 
automorphism is found, it is added as a new generator for the automorphism group of the input 
graph; the new orbits of graphs coming from individualizations are computed as those of the 
stabilizers of the individualized vertices. All these group computations are made possible in 
Traces using Leon's implementation [17] of the Schreier-Sims algorithm [28, 27]; 

(iii) multi-refinements allow for decreasing the depth of the search tree: from one hand, when 
they produce finer partitions wrt simple refinements; from the other hand, when they reach 
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Algorithm 1 Multi-refinement 



MULTlREFINE(iJ, Tt) 

Input: (H, tt) e G and S C Aut{H, tt) 
repeat 
{ 

let {H, {Wi,..., Wk)) = REFiNE(i?, tt) 
let = set of representants of orbits of S 
i = 1 

discrete = FALSE 
split = FALSE 
while {i < k) do 
if {\Wi\ > 1) then 
{ 

for (w e Wi n n) do 

{ 

p[w\ = it' such that REFINE(i?, lNDIVIDUALIZE(7r, w)) = {H, tt') (★) 

if {p[w] is discrete) then 
discrete = TRUE 

} 

if (discrete) then 
{ 

Check for Automorphisms between elements of p and update S 
return {H, tt) where w is the discrete partition in p which induces 
the lexicographically smallest quotient graph 

} 

else 
{ 

Split (if possible) the cell Wi into Wi^i, . . . , Wi^h according to the ordering 

of the quotient graphs induced by elements of the array p 
if {Wi has been split) then 
{ 

TT = (Wl, . . . , W_l, W,l , . . . , Wi,h, Wi+l,..., Wk) 

split = TRUE 

break the while loop 

} 

} 

i = i+l 

} 

} 

until (not split) 
return (.ff, tt) 
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a discrete partition while individualizing some vertex, thus adopting it as their result; it is 
important to note that such choice is isomorphism invariant; 

(iv) in splitting a cell, quotient graphs are evaluated for comparing partitions. A method allowing 
to differentiate partitions without computing them completely will be presented in section 3.1. 

As a general remark about the nature of multi-refinements, we observe that, being based on 
equivalence classes of refinements of all possible individualizations, they carry out information 
about the whole graph. This is a clear difference with respect to vertex invariants, which have 
an essentially local character. 




(F,7r(3)) (i^,7rW) (F,7r(5)) Multi-refinement 

iv V vi vii 



Figure 3: Multi-refining the Fiirer gadget in the first Miyazaki's graph 

In order to give a flavor of the behavior of the multi-refinement procedure, and to realize 
that Miyazaki's construction is not critical for Traces, we show in Figure 3 the simplest graph in 
Miyazaki's series. The graph and its coloring appear in Figure 3.i. We focus on the right part of 
the graph, only. For this graph F, first introduced by Fiirer (see [7]), we show the refinements 
obtained after individualization of the vertices 1 to 5, in Figure 3.ii to vi, respectively. It comes 
out that (i^, tt'^-'^^) and (i^, tt^^^^) have the same quotient graph, as well as (i^, tt*^"^^) and (i^, tt^^^^). 

tt'^'*^), the quotient graph of (F, ttI^^^), however, is different from Q(F, tt^^^), since the vertex 
9 has the same color in both graphs, but different colors for neighbors. Therefore, the cell 
{3,4,5,6} is split, and this also causes the splitting of the cell {1,2}. The multi-refined graph 
is reported in Figure 3. vii. Any cell will produce at the next step a discrete partition after the 
individualization of its vertices, for this fragment of the whole graph. We conclude this section 
observing that for any colored graph in Miyazaki's sequence [23], the initial multi-refinement is 
able to produce the orbit partition. 

3.1 Comparing refinements and multi-refinements 

The quotient graph of a refined graph is an isomorphism invariant of G, otherwise contradicting 
(ii) of Def.2.8. Therefore, given colored graphs Gi — {Hi,tti) and G2 — {H2,tt2), we have been 
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allowed (in Algorithm 1) to classify their refinements by comparing the induced quotient graphs 
Q{R{Gi)) and Q{R{G2))- If they are different, then there is no isomorphism between G\ and 
G2- In particular, if G\ and G^ are different colorings of the same graph H, during the search 
for a canonical form of H we can discard either G\ or G2 according to a predefined ordering on 
quotient graphs, e.g. the lexicographic one. 

However, refinements can be compared without computing them completely. This has been 
already observed by Junttila and Kaski in [15] in the case of singleton cells emerging during the 
refinement process (the consequent invariant is called partial leaf certificate^ and is adopted in 
Traces, too). In addition to this, we introduce and implement a different invariant which we will 
name refinement trace. 

Assume that the cell W of the partition tt is split during the refinement process into W and 
W" . This splitting gives the partition tt' such that: 

pos {W', tt') = pos {W, tt) and k = pos {W" , tt') = pos {W, tt)+ \ W' \ . 

The new position k created by splitting the cell W is a trace element of the refinement process. 
The refinement trace is the sequence of trace elements successively introduced during refinement; 
it is isomorphism invariant and can be stored into an array. Moreover, let us consider the 
alternation of individualization and refinement steps which is needed to compute a discrete 
partition; the whole process has its own trace, since the individualization operation consists of 
a cell splitting; too. Such trace has length at most n, and each of its elements appears exactly 
once in it. Note that, being sequences of integers, traces can be ordered, e.g. component-wise. 

Assume now that the refinement of Gi = (Hi, in) has been computed and r = (n, . . . , Tm) is 
its trace. While computing the refinement of G2 = (-ff2, ^^2) wc can stop as soon as we find a trace 
element r^' different from the corresponding Tj. This is sufficient to realize that the refinements 
of Gi and G2 are different. Obviously, if we choose an ordering on traces and r/ is "better" than 
Ti, than the refinement of G2 will be completed and its trace will be stored for later comparisons. 

Multi-refinements are classified by means of their traces, too. In this case, the sequence 
which contains the new positions created by the splitting cell is initially recorded, and then 
expanded with the sequence originating from calling the refinement procedure (first statement 
of the repeat loop in Algorithm 1) on the updated partition. 

3.1.1 The target cell 

A serious motivation for introducing multi-rcfincmcnts is that they define a family of target cell 
selectors, each of which is based on information coming from the next individualization step. 

Let {H, tt) be a colored graph. During the construction of its multi-refinement {H, tt'), we keep 
information about the sizes of partitions deriving from refinements of vertex individualizations, 
and then we choose as target cell the leftmost one in tt' which will subsequently produce the 
partition with the maximal size. This choice is isomorphism invariant. It comes out that multi- 
refinement allows the cell selector to behave in a fair way: a cell is never left unconsidered for 
being the one from which the next individualizations will come out. This appears as a major 
property if we want to avoid inefficiencies coming from the wrong choice of the target cell. 

3.2 Traversing the secirch tree 

For the sake of readability, we will use a simplified visual representation of a search tree, in which 
the root is labeled by a partition and any other node is labeled by the vertex of the graph which 
is individualized at that stage of the computation. Therefore, if a node is labeled by a vertex v, 
the intended meaning is that (i) v is an element of the target cell of the partition tt represented 
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Figure 4: First iteration 



by its father; (ii) the node v represents the partition tt''^\ which is obtained by applying the 
multi-refinement function. 

Traces does not perform backtrack on the search tree. The tree will be visited and pruned 
according to a strategy which can be viewed as an iterated depth- first visit of a forest of AND/OR 
trees. In such trees, nodes are discriminated between AND nodes and OR nodes. AND nodes are 
those which require all their children to be visited, while only one child is visited when starting 
from an OR node. The tree is visited by level iterations; at the i-th iteration, the forest of trees 
rooted at level i is considered, together with the assumption that the roots of such trees are 
AND nodes, while all other nodes are OR nodes. The description of such strategy will be now 
detailed as an abstract example, with the help of some figures. 

First iteration: Given the colored graph (iJ, ttq) and the multi-refinement R, let tt be the 
root of the search tree, where R{H, ttq) = {H, tt) . The multi-refinement procedure determines the 
target cell for tt, which we assume here to be {oi, 02, as, 04, a^}. We start by individualizing oi 
and multi-refining (iJ, 7rJ,ai), thus obtaining the first child of n. As mentioned above, the label 
ai in the tree is a placeholder for the partition Tr^°'^\ Now, we proceed down the tree with the 
aim of computing a discrete partition. Hence, let 61 be an element of the target cell of tti = tt^"^) 
and ci an element of the target cell of 772 = 'k^^\ 

Let us assume that tts ~ 7^2''^^ is discrete. By the definition of multi-refinement, all possible 
individualizations of vertices (except for those belonging to singleton cells in 772) have been 
performed, cell by cell, to produce 773 from 7r2ici. It may be the case, as we now assume, 
that a pair of such individualizations determine an automorphism 70 of the graph H, as from 
Algorithm 1. Clearly, such automorphism fixes ai, 61 and ci, while it moves some other vertices. 
In particular, assume that 70(02) = as- This implies that the subtree rooted at 03 can be pruned 
from the search tree, since it is isomorphic to the subtree rooted at 0,2. 

A path from the root of the search tree to a leaf is identified by the sequence of individualized 
vertices, and will be called an attempt. Therefore, (ai, 61, ci) is an attempt. The discrete partition 
TTs induces a candidate canonical form, which we store for later use. 

We now go back to a2 and we start a new path toward a leaf. When the new attempt 
(02, di,ei) has been completed, we compare the induced candidate canonical form with the one 
previously stored. If they are different, we discard one of them according to their lexicographic 
ordering, otherwise a new automorphism 71 has been found, such that 71 (ai) = 02,71(61) = 
c^i,7i(ci) = ei. This implies that the whole subtree rooted at 02 is isomorphic to the one rooted 
at fli, and it can be discarded. 

We now go back to 04, we compute the corresponding attempt and we assume that it does 
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not give any further information. So we consider the final node of the first iteration, 05. Here 
we want to describe the pruning operation which occurs when tt*-"^^ induces a quotient graph 
different from those already computed at the same level of the tree, say ■k^°'^\ This is clearly a 
node invariant. In this case we prune one of the subtrees, according again to their lexicographic 
ordering. In the previous figure, we have drawn a black circle around nodes pruned this way, 
while a white node is intended to be pruned by automorphism discovery. 




Figure 5: Second iteration (left); final iteration (right) 



Second iteration: Due to the pruning performed during the previous iteration, we are left 
with the nodes oi and 04 to be expanded. So we start from the tree rooted at ai and we consider 
the corresponding target cell. We use the information on the subgroup F of the automorphism 
group of H obtained so far. We compute the orbits of the stabilizer of F with respect to oi and 
we consider one representant for each orbit which intersects the target cell. Let 61,62,63 the 
vertices resulting from such filtering. We proceed exactly as in the previous cases, computing 
the required attempts and collecting the corresponding information. In the present figure, while 
comparing the candidate canonical forms deriving from the attempts (ai, 63, hi) and (04, /i, gi) 
an automorphism 72 such that 72(01) = 04,72(63) = /i,72(/ii) = 91 has been found, and the 
tree has been consequently pruned at node 04. 

Final iteration: We proceed exactly as in the previous iteration, first considering the 
stabilizer of the subgroup Fi of the automorphism group of H which has already been computed, 
now with respect to a pair of vertices. In the present figure, the attempts ending in Ci and C2 
have been discarded by detecting a Setter candidate deriving from the attempt (oi, 63, hi), while 
an automorphism has been detected which fixes oi and 63 and associates hi to /12. It comes out 
that the whole automorphism group of H has been computed, together with its generators, and 
the canonical form of H is the one induced by the attempt (oi, 63, hi). 

Summarizing, the search tree is visited by levels. For every node appearing at some level, 
that node is either discarded (by automorphism or by a node invariant criterion) or a path from 
it to a leaf is computed. The remaining candidate is taken as the canonical form of the input 
graph. As a final remark, we observe that the structure of the visiting strategy hints to the 
parallel implementation of all attempts originating at the same level. 

3.3 A VEiriant: computing the automorphism group 

As a final contribution, we describe the algorithm which has been implemented in Traces for 
computing generators of the automorphism group of a graph. 

Assume, for the sake of simplicity, that all discrete partitions appear at the same level £ in 



13 



the search tree associated to the colored graph G = {H, n) G G, as shown in Figure 6. left. The 
extension to the general case is straightforward. 

The algorithm proceeds as in the case of canonical labeling, down to the {i — l)-th level 
of the search tree. When the {£ — l)-th iteration of the algorithm has ended, we are left with 
the partitions tti, 7r2, • . • , tt^ to be the object of vertex individualizations. At this stage of the 
computation, in order to obtain the automorphism group of the initial graph, we proceed as 
follows: 

T{H,ir)=o^ T{H', 7r')=o 




Figure 6: Computing the automorphism group and isomorphism testing 



Procedure A. Let {H, tt) e G and let tti, . . . , TTfe be the roots of subtrees appearing at level i — 1 
in T{H.Ti), whose leaves are at level t. Let $ C KvLt{H) be the subgroup of the automorphism 
group of H computed so far. 

Step 1 Compute, in the usual way, individualizations of vertices in the target cell of tti and 
store the permutations corresponding to the obtained discrete partitions; update $ if new 
generators for Aut(if) are found; 

Step 2 For i = 2, . . . , fc, individualize only one vertex in the target cell of TTj, then refine the 

partition; compare the resulting permutation with the stored ones, in search for possible 
new automorphisms, and then store it on its turn, if no new automorphism is found. 

At the end of the process described in Procedure A, the whole automorphism group of the 
initial graph has been computed. 

Proposition 3.4. Procedure A returns the (generators of the) automorphism group of the graph 
H. 

Sketch of the proof. Let = [vi^ , ■ ■ ■ , Vit-i) be the sequence of vertices which have been indi- 
vidualized from the root of the tree down to tti ; if a new automorphism 7 fixes K-i , then it is 
detected during Step 1 of Procedure A. Otherwise, if, for some i, the tree rooted at tti is isomor- 
phic to the one rooted at tti , then the unique leaf computed from tt^ is sufficient to determine the 
mapping between the two trees, adding such mapping to $. At the end of Step 2 of Procedure 
A, all subtrees isomorphic to n are pruned, hence all the elements of Aut(i?) which do not fix 
V^i have been generated. □ 

Permutations stored by the application of Procedure A can be used for isomorphism testing. 
In effect, suppose that we are given another graph G' = {H',Tr') € G, as in Figure 6. right. If 
G and G' are isomorphic, then one of the trees rooted at level £ — 1 in T{H',Tr'), say the one 
rooted at tt- in Figure 6, must be isomorphic to the tree rooted at tti in T{H,Tr). Hence the 
unique permutation computed by individualizing one vertex of H' in tt- induces one isomorphism 
between G and G' with at least one of the stored permutation originating from tti. 

We observe that: (i) the breadth-first structure of the visiting strategy is particularly well- 
suited for the method described above; (ii) at the cost of storing permutations, GI can be tested 
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in a more efficient way without computing the automorphism group of the union of the graphs; 
(iii) the method can be apphed at any level of the tree; (iv) the final level of the search tree 
is often the one where refinement becomes heavy, since all regularities of the initial graphs are 
broken. As we will see from experimental results, this will make graph isomorphism testing 
feasible also in cases of very hard graphs. 

4 Experimental results 

The algorithm presented in the paper is now compared with nauty and bliss. A comparison 
with saucy can be obtained from our performance tables and those presented in [15]. Graphs 
are selected from the library of benchmarks which is attached to the hliss distribution ([14]). In 
order to obtain information on sizes of search trees, it has been decided not to time out, whenever 
reasonably possible, any experiment. As a consequence, only a few graphs have been considered, 
and, for each of them, only one instance. However, this seems to be sufficient to obtain a clear 
comparison between the different approaches. Experiments have been carried out on a Apple 
Mac Pro with 2 Quad Core processors at 2.8 GHz, under gcc 4.0. For each experiment, the 
following information is reported in Tables 1 and 2: (1) the name of the graph, (2) the number 
of its vertices and edges, (3) the size of the automorphism group of the graph, (4) the number of 
its orbits; for nauty and hliss, the execution time (in seconds) and the size and maximal level of 
the associated search tree (column Refine {(?) , considered as the amount of calls of the refinement 
function). For Traces, the following information is reported; (i) the execution time (in seconds), 
(ii) the amount of calls to the multi-refinement function (which gives the measure of the search 
space) together with the amount of individualizations needed to compute a discrete partition 
(which indicates the depth of the search space) , (iii) the number of computed generators of the 
automorphism group, (iv) the time spent in managing the group. In the case of the computation 
of the automorphism group (without searching for the canonical form) , the number of residual 
permutations needed for isomorphism testing is reported in the rightmost column. 

With reference to the classification in the mentioned graph library, we have selected graphs 
from the following families: 

- affine and projective geometries: graphs ag2-x, pg2-32; 

- Cai-Fiirer-Immerman construction: graphs cfi-x; 

- constraint satisfaction problems: graphs difp-20-0, fpga-x-y, s3-3-3-3, urq8-5; 

- Hadamard matrices: graphs had-x, had-sw-x (with some switching operations); 

- Miyazaki constructions: graphs mz-x, mz-aug-x, mz-aug2-x; 

- projective planes: graphs ppl6-x; taken from [24]: flag-6; 

- random regular graphs: graphs rnd-3-reg-x-y; 

- strongly regular graphs: graphs latin-x, latin-sw-x-y, lattice-30, sts-x, sts-sw-x-y; 

- complete graphs: graphs k-x: 

- grid graphs: graphs grid-x-y, grid-w-x-y. 

We provide two tables of results: in Table 1, graphs having large search trees in nauty and 
bliss are considered, while in Table 2 experiments with graphs having small search trees are 
reported. 

Table 1: graphs w^ith large search space. For all the considered graphs, Traces exhibits 
a drastic decrease for the size of their search space, with clear consequences over computation 
time. It comes out that the hardest instances are graphs with small automorphism group, such 
as had-236. The gain in performance of Traces wrt nauty and bliss is considerable for all graphs 
in the pp-16 family, with the addition of flag-6 (taken from Moorhouse's library [24]), which 
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comes from a projective plane of order 27. 

The reader can verify the exponential contraction of the search space of Traces wrt the ones 
of nauty and bliss in the case of Miyazaki's sequence mz-aug2, also observing that timings include 
group computation. 

Concerning the depth of the search space, its contraction is evident in all cases. 

In addition, Traces seems to have a more stable behavior on different instances of graphs in 
the same family (the required multi-refinements and computation time being related to the sizes 
of the input graph and of its automorphism group) , and also on different representations of the 
same graph, as shown by the following Table, where the canonical forms of graphs coming from 
random permutations of veritces of pp-16-8, a projective plane of order 16, are computed. 





bliss (0.35) - can 

Canon, time Refinements {() 


Traces - can 

Canon, time Multi-rcfs (ind) 


pp-16-8 


197.71 16589102 (5) 


0.42 139 (3) 


permj^ 


3663.95 311919402 (5) 


0.44 162 (3) 


perm2 


1765.20 150066473 (5) 


0.40 137 (3) 


perm3 


8183.51 704831435 (5) 


0.50 167 (3) 


perm4 


2938.24 254399812 (5) 


0.45 155 (3) 


permg 


5360.36 456661795 (5) 


0.53 169 (3) 



It comes out that Traces has a stronger ability of capturing the structure of the graph, thus 
abstracting from its representation. A further evidence of such claim can be also deduced from 
the series cfi-x, where the depth of Traces' search space is equal to the main parameter introduced 
by Cai,Fiirer and Immerman in their construction [7] . 

Table 2: graphs with small search space. Graphs exhibiting a small search space (wrt the 
size of their vertex set) come out to have either a large automorphism group or a trivial one. In 
the first case, the search space is massively pruned by automorphisms, in the second case just a 
few individualization steps arc needed to obtain discrete partitions. These are the most favorable 
situations for the individualization-refinement technique. Still Traces is able to reduce both the 
depth and size of search space. However, it is clear that the overhead determined by multiple 
refinements causes a (sometimes considerable) loss of performance. The most critical experiments 
for Traces are very sparse graphs (s3-3-3-3 and grid graphs): we observe that, in such graphs, 
partitions induced by simple refinement are equal to those deriving from multi-refinement, and 
this happens for every individualization step. Therefore multi-refinements are not able to get 
any additional information. As expected, in these cases, the ratio between the execution time 
of Traces and nauty (or bliss) can reach the number of vertices of the input graph. Comparing 
these results, however, we must notice that Traces^ beyond representing graphs by means of 
adjacency lists, does not implement any other technique for handling large and sparse graphs, 
yet. 

Table 2 shows that the time spent in group computation becomes significant when dealing 
with very large automorphism groups: this is evident in the case of complete graphs. 



16 



Graph 


(V,E) 


|Auti 


Orb 


nauty 1 
Time 


2.2) - aut 

Refine (J:) 


bliss (0.35) - can 
Time Refine {(!.) 


bliss (0.35) - aut 
Time Refine 




iViii.et t^mci J 




Grp 


Time 


Traces 
MRef 


Grp 


Rad 


had-52 


(208. 5512) 


24-13 


2 


10-73 


190034 (5) 


0.50 


14536 (4) 


0.50 


15533 


0_26_ 


287 (2) 


^ 


""^ 


0.15 


128 


0.01 




had-100 


(400, 20200) 


2^.52 


2 


185.00 


1348634 (5) 


4.23 


52819 (4) 


4.43 


58613 


3.20 


725 (2) 




0.04 


1.83 


321 


0.03 


103 


had-184 


(736, 68080) 


2^-23 


2 


8544.87 


4287786 (6) 


33.34 


111862 (5) 


34.96 


125285 


10.13 


778 (2) 


7 


0.12 


6.11 


356 


0-08 


143 


had-232 


(928, 108112) 


2^-29 


2 


29352.07 


8592888 (6) 


80.48 


183720 (5) 


84.51 


199679 


26.41 


968 (2) 


7 


0.21 


14.99 


438 


0.04 


179 


had- 236 


(944, 111864) 


2 


472 




Timed out 


23322.51 


104043906 (3) 


23150.37 


104043906 


7599.66 


222787 (2) 


1 


16.99 


103.36 


1888 


0.16 


941 


had-sw-32-1 


(128, 2112) 


22 


42 


36.53 


1149410 (5) 


6.06 


192369 (4) 


4.64 


141832 


0.59 


2251 (2) 




0.05 


0.08 


168 


0.02 


81 


had-sw-88 


(352, 15664) 


22 


132 


17633.55 


80502284 (6) 


309.90 


3173883 (3) 


322.61 


3645512 


40.90 


19452 (2) 


— — 

2 


0.71 


2.53 


663 


0.01 


305 


had-sw-112 


(448, 25312) 


2 


132 




Timed out 


939.28 


10990338 (3) 


925.85 


10990338 


240.79 


50179 (2) 


1 


1.42 


7.12 


896 


0.01 


445 


mz-aug2-18 


(432, 684) 


238 


252 


106.08 


5374331 (38) 


26.30 


1048954 (37) 


22.64 


1048954 


1.21 


145 (19) 


38 


0.41 


1.22 


145 


0.50 


2 


mz-au;;2-2II 


( IM* 7GI.)) 


,-,12 .,^(, 




1 


110 i7r> 1 r 111 


lllKl 1 


110 17G1 


1 -il 


ini r2i) 


12 


0.G2 


i.'>n ini n.ni 2 


mz-iuig2-22 


(r>2rt, 


i(i 




2r>()(),(ji 1 


)27G1)!J!J9 {!(>) 






439. 




2. 10 


177 (23) 


-IG 


0,.sl 


2.()(i 


177 


0.81) 


2 


' ' ''''' ^^^-''^ 




mz-aug2-30 


(720, 1140) 


262 


420 














5.48 


241 (31) 


62 


2.24 


5.47 


241 


2.25 


2 


mz-aug2-50 


(1200, 1900) 


2IO2 


420 














25.53 


401 (51) 


102 


9.39 


25.44 


401 


9.32 


2 


pp-16-1 


(546, 4641) 


> 3.42el0 


1 


0.06 


lAA (6) 


0.02 


133 (5) 


0.02 


144 


0.09 


16 (4) 


6 


0.02 


0.09 


16 


0.03 


1 


pp-16-2 


(546, 4641) 


28.3-5 


10 


42288.08 


80597650 (5) 


23886.33 


1994354150 (5) 


1868.28 


80597650 


50.74 


11907 (3) 


5 


1.54 


2.51 


646 


0.09 


223 


pp-16-4 


(546, 4641) 


2^2.3 


6 


5267.15 


10311534 (5) 


3624-68 


301374561 (5) 


282.26 


10311534 


15-78 


3877 (3) 


3 


0.55 


2.77 


622 


0-07 


311 




(546, 4641) 


2II.32 


5 




Timed out 


17769-12 


901487681 (5) 


18438.75 


968486421 


19.94 


4864 (3) 


5 


0.97 


0.96 


304 


0-06 


84 


pp-16-7 


(546, 4641) 


2I4-32 


3 


Timed out 


370-87 


16177794 (5) 


4662.65 


210159039 


1.63 


402 (3) 


8 


0.19 


0.38 


106 


0.04 


24 


PP-16-S 


(546, 4641) 


2 15 , 33 


3 


683.93 


1706149 (5) 


197.71 


16589102 (5) 


23.58 


1170686 


0.42 


139 (3) 


8 


0.10 


0.22 


70 


0-05 


14 


pp-16-9 


(546, 4641) 


2-'-2 .32 .52 


6 


15598.57 


35144658 (5) 


3575.12 


303816170 (5) 


825.15 


35107649 


0.39 


187 (3) 


4 


0.07 


0.24 


118 


0-04 


32 


pp-16-11 


(546, 4641) 


2I2 .32 .7 


6 




Timed out 


916-16 


45141041 (5) 


4276.61 


219395299 


1.07 


379 (3) 


6 


0.15 


0.35 


132 


0-02 


32 


pp-16-15 


(546, 4641) 


2II-33 


8 




Timed out 


1929-39 


145623212 (5) 


3515.83 


278419116 


3.88 


1092 (3) 


6 


O.IS 


0.46 


193 


0-05 


70 


pp-16-17 


(546, 4641) 


2^^ -32 -5 


8 


7032.42 


14074851 (5) 


2576.56 


216934489 (5) 


343.41 


14074851 


2.37 


691 (3) 


7 


0.18 


0.46 


193 


0.05 


55 


pp-16-19 


(546, 4641) 


2^ 32 


14 




Timed out 


8268.57 


666219267 (5) 


3792.12 


130609514 


78.80 


18877 (3) 


5 


2.93 


3.50 


858 


0.15 


355 


pp- 16-21 


(546, 4641) 


2''' -3^ 


12 




Timed out 


10249.02 


617020245 (5) 


25871.22 


1622520018 


53.41 


12895 (3) 


4 


1.71 


2.79 


681 


0.14 


300 


flag- 6 


(1514, 21196) 


2^-3'^-7 


4 




Timed out 


16396.88 


749371122 (5) 


26665.97 


706556032 


584.45 


12304 (3) 


7 


5.85 


40.45 


1385 


0.68 


67 


sts-67 


(737,35376) 


3 


253 


8.91 


23841 (3) 


8.04 


157566 (2) 


7.77 


157566 


1.67 


255 (1) 


1 


0.02 


2.12 


255 


0.01 


253 


sts-sw-21-10 


(70,945) 


1 


70 


0.03 


1961 (3) 


0.03 


3011 (2) 


0.03 


3011 


0.00 


71 (0) 







0.00 


71 




70 


sts-sw-55-1 


(495,19305) 


1 


495 


7.76 


39106 (3) 


6.72 


206416 (2) 


6.57 


206416 


1.75 


496 (1) 







2.12 


496 




495 


sts-sw- 79-11 


(1027,58539) 


1 


1027 


73.09 


118106 (3) 


50.74 


937652 (2) 


49.80 


937652 


17.01 


1028 (1) 







19.03 


1028 




1027 



Table 1: Graphs with large search tree 



Graph 




|Aut| 


Orb 


nauty (2.2) - can 


bliss (0.35) - can 




Traces - can 






Time 


Refine (i) 


Time 


Refine {£) 


Time 


MRef (ind) 


Gns 


Grp 


ag2-16 


(528,4352) 


214.32.52.17 


2 


0.03 


82 (5) 


0.02 


100 (4) 


0.08 


12 (3) 


6 


0.02 


ag2-49 


(4851,120050) 


210.32.52.76 


2 


1.58 


52 (5) 


0.48 


51 (4) 


44.63 


12 (3) 


5 


0.14 


cfi-20 


(200, 300) 


2" 


80 


0.03 


146 (10) 


0.01 


262 (9) 


0.03 


28 (5) 


11 


0.01 


cfi-80 


(800, 1200) 


241 


320 


3.81 


1125 (36) 


0.26 


2802 (36) 


3.05 


118 (20) 


41 


1.19 


cfi-200 


(2000, 3000) 


2101 


800 


117.07 


5597 (90) 


5.30 


33893 (92) 


67.65 


298 (50) 


101 


37.13 


difp-20-0 


(8965,23082) 


1 


8965 


0.02 


1 (1) 


0.01 


1 (0) 


0.61 


1 (0) 





- 


fpga-10-8 


(688,1320) 


223.3 


519 


0.05 


300 (24) 


0.01 


300 (23) 


0.52 


71 (16) 


21 


0.16 


fpga-13-11 


(1500.3125) 


232.37 


1116 


O..39 


780 (39) 


0.02 


39 (38) 


4.09 


112 (26) 


31 


1.27 


griil-:-!-2() 


(8000,22800) 




220 


2.27 


5 (2) 


0.05 


5 ll) 


19. 19 


1 (0) 




0.08 


grid-w-2-100 


(10000,20000) 


2^.54 


1 


18.33 


7 (3) 


0.09 


8 (2) 


850.35 


3 (1) 


5 


0.23 


grid-w-3-20 


(8000,24000) 


210.3-53 


1 


4.22 


8 (3) 


0.08 


8 (2) 


144.29 


3 (1) 


6 


0.25 


k-70 


(70,2415) 


> 1.97el00 


1 


0.01 


2845 (70) 


0.02 


2485 (69) 


3.08 


148 (68) 


8 


3.04 


k-100 


(100,4950) 


> 9.33el57 


1 


0.01 


5050 (100) 


0.05 


5050 (99) 


26.33 


208 (98) 


8 


26.25 


latin-30 


(900,39150) 


26.33.52 


1 


0.17 


60 (5) 


0.18 


99 (3) 


0.27 


3 (1) 


5 


0.02 


latin-sw-21-8 


(441,13230) 


1 


441 


5.44 


26902 (3) 


6.43 


168022 (2) 


1.23 


442 (1) 





- 


latin-sw-30-11 


(900,39150) 


1 


900 


49.79 


79255 (4) 


46.32 


731701 (2) 


12.43 


901 (1) 





- 


lattice-30 


(900,26100) 


> 1.40e65 


1 


0.30 


930 (31) 


0.40 


901 (30) 


9.90 


122 (28) 


33 


8.18 


mz-18 


(360, 540) 


239 


90 


0.20 


722 (21) 


0.01 


631 (20) 


0.64 


76 (18) 


24 


0.18 


mz-50 


(1000, 1500) 


2 


250 


> 24000 


Timed out**' 


0.30 


8653 (52) 


24.76 


204 (50) 


56 


8.09 


mz-aug-22 


(440, 1012) 


247 


110 


0.24 


845 (25) 


0.03 


778 (24) 


1.46 


139 (23) 


28 


0.60 


mz-aug-50 


(1000, 2300) 


2103 


250 


> 24000 


Timed out^**) 


> 24000 


Timed out^**** 


24.51 


307 (51) 


56 


14.32 


I)alcA'-161 


(161,53015) 


2.5.23.161 


1 


0.01 


6 (3) 


0.07 


6 (2) 


0.52 


3 (1) 


2 


0.00 


pg2-:-!2 


(2111,:-! 1881) 


> 1.09el:-i 


1 


1.70 


1)12 (0) 


0.21 


8.->8 (5) 


1.10 


11 (0) 


() 


0.13 


rnd-3-icg-3000-l 


(3000,4500) 


1 


3000 


328.57 


3001 (2) 


0.39 


3001 (1) 


0.31 


1 (0) 







rnd-3-reg-lOOOO-l 


(10000,15000) 


1 


10000 


42821.01 


10001 (2) 


5.60 


10001 (1) 


4.24 


1 (0) 







S3-3-3-3 


(11076,20218) 


2I2 


7836 


130.06 


91 (13) 


0.10 


91 (12) 


2053.71 


43 (11) 


12 


1.28 


urq8-5 


(3906,20331) 


1 


3906 


0.00 


1 (1) 


0.01 


1 (0) 


0.18 


1 (0) 








(*'9.68 sees, 3249 (52) nodes in nauty-aut; (**)3.34 sees, 3004 (53) nodes in nauty-aut; (***'0.14 sees, 3005 (53) nodes in bliss-aut; 



Table 2: Graphs with small search tree 



5 Concluding remarks and further work 



Starting from a critical investigation of the individualization-refinement technique for canonical 
labeling of graphs, a new algorithm has been presented with the aim of reducing the associated 
search space. The main novelties of the proposed method can be found (i) in the refinement 
mechanism, (ii) in the choice of individualization steps, (iii) in the procedure for computing 
the automorphism group of a graph, (iv) in the possibility of comparing refinements without 
computing them completely. 

Experimental results have been provided showing a significant improvement of performances 
in the case of graphs of combinatorial origin which are considered as critical for any algorithm 
adopting the individualization-refinement method. The comparison between tables of results 
presented in the paper leaves an interesting scenario and several directions for further work. 
In effect, the new approach and the state of the art ones are in some sense complementary, 
which hints to their possible integration. As an example. Traces and nauty (or bliss) could 
exchange information about detected automorphism of the input graph in order to obtain the 
best performances from one another. 

At the implementation level, Traces is still at an initial stage; some engineering work should 
be done in the same spirit of bliss wrt nauty, with a special attention to the multi-refinement 
procedure and to its possible parallel implementation. 

From a theoretical standpoint, the main achievement of this paper is that Traces does not 
have an exponential behavior on any of the benchmark families of graphs considered in the 
literature for the individualization-refinement method. In our opinion, this is due to a better 
choice of the target cell and to the ability of decreasing the depth of the search space; it is our 
intention to investigate further such aspects of the proposed algorithm. 
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